Centralized Multi-Node Repair Regenerating Codes

نویسندگان

Marwen Zorgui

Zhiying Wang

چکیده

In a distributed storage system, recovering from multiple failures is a critical and frequent task that is crucial for maintaining the system’s reliability and fault-tolerance. In this work, we focus on the problem of repairing multiple failures in a centralized way, which can be desirable in many data storage configurations, and we show that a significant repair traffic reduction is possible. The fundamental functional tradeoff between the repair bandwidth and the storage size for functional repair is established. Using a graph-theoretic formulation, the optimal tradeoff is identified as the solution to an integer optimization problem, for which a closed-form expression is derived. Expressions of the extreme points, namely the minimum storage multi-node repair (MSMR) and minimum bandwidth multi-node repair (MBMR) points, are obtained. We describe a general framework for converting single erasure minimum storage regenerating codes to MSMR codes. The repair strategy for e failures is similar to that for single failure, however certain extra requirements need to be satisfied by the repairing functions for single failure. For illustration, the framework is applied to product-matrix codes and interference alignment codes. Furthermore, we prove that functional MBMR point is not achievable for linear exact repair codes. We also show that exact-repair minimum bandwidth cooperative repair (MBCR) codes achieve an interior point, that lies near the MBMR point, when k ≡ 1 mod e, k being the minimum number of nodes needed to reconstruct the entire data. Finally, for k > 2e, e | k and e | d, where d is the number of helper nodes during repair, we show that the functional repair tradeoff is not achievable under exact repair, except for maybe a small portion near the MSMR point, which parallels the results for single erasure repair by Shah et al. Index Terms Regenerating codes, distributed storage, multi-node repair, minimum storage, minimum bandwidth.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hybrid Regenerating Codes for Distributed Storage Systems

Distributed storage systems are mainly justified due to their ability to store data reliably over some unreliable nodes such that the system can have long term durability. Recently, regenerating codes are proposed to make a balance between the repair bandwidth and the storage capacity per node. This is achieved through using the notion of network coding approach. In this paper, a new variation ...

متن کامل

Centralized Repair of Multiple Node Failures with Applications to Communication Efficient Secret Sharing

This paper considers a distributed storage system, where multiple storage nodes can be reconstructed simultaneously at a centralized location. This centralized multi-node repair (CMR) model is a generalization of regenerating codes that allow for bandwidth-efficient repair of a single failed node. This work focuses on the trade-off between the amount of data stored and repair bandwidth in this ...

متن کامل

Repair Strategies for Storage on Mobile Clouds

We study the data reliability problem for a community of devices forming a mobile cloud storage system. We consider the application of regenerating codes for file maintenance within a geographically-limited area. Such codes require lower bandwidth to regenerate lost data fragments compared to file replication or reconstruction. We investigate threshold-based repair strategies where data repair ...

متن کامل

Quasi-cyclic Flexible Regenerating Codes

In a distributed storage environment, where the data is placed in nodes connected through a network, it is likely that one of these nodes fails. It is known that the use of erasure coding improves the fault tolerance and minimizes the redundancy added in distributed storage environments. The use of regenerating codes not only make the most of the erasure coding improvements, but also minimizes ...

متن کامل

Explicit Code Constructions for Distributed Storage Minimizing Repair Bandwidth

Regenerating codes are a class of recently developed codes for distributed storage, that permit data recovery from any k of n nodes, and also have the capability of repairing a failed node by connecting to any d nodes and downloading an amount of data, termed the repair bandwidth, that is on average, significantly less than the size of the data file. These codes optimally trade the storage spac...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1706.05431 شماره

صفحات -

تاریخ انتشار 2017

Centralized Multi-Node Repair Regenerating Codes

نویسندگان

چکیده

منابع مشابه

Hybrid Regenerating Codes for Distributed Storage Systems

Centralized Repair of Multiple Node Failures with Applications to Communication Efficient Secret Sharing

Repair Strategies for Storage on Mobile Clouds

Quasi-cyclic Flexible Regenerating Codes

Explicit Code Constructions for Distributed Storage Minimizing Repair Bandwidth

عنوان ژورنال:

اشتراک گذاری